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ENTERED 



3 <110> APPLICANT: Covacci, Antonio 

4 Bugnoli, Massimo 

5 Telford, John 

6 Macchia, Giovanni 

7 Rappuoli, Rino 

9 <120> TITLE OF INVENTION: Helicobacter Pylori Cytotoxin Proteins Useful For 
10 Vaccines And Diagnostics 

12 <130> FILE REFERENCE: CHIR0315 

qC~> 14 <140> CURRENT APPLICATION NUMBER: US/09/921,157 



v 



C — > 15 <141> CURRENT FILING DATE: 2001-08-02 

17 <150> PRIOR APPLICATION NUMBER: 08/466,662 

18 <151> PRIOR FILING DATE: 1995-06-06 

20 <150> PRIOR APPLICATION NUMBER: 08/256,848 

21 <151> PRIOR FILING DATE: 1994-10-21 

23 <150> PRIOR APPLICATION NUMBER: 09/360,934 

24 <151> PRIOR FILING DATE: 1999-07-26 

26 <150> PRIOR APPLICATION NUMBER: 08/471,491 

27 <151> PRIOR FILING DATE: 1995-06-06 

29 <150> PRIOR APPLICATION NUMBER: PCT/EP93/004 72 

30 <151> PRIOR FILING DATE: 1993-03-02 

32 <150> PRIOR APPLICATION NUMBER: PCT/EP93/00158 

33 <151> PRIOR FILING DATE: 1993-01-25 

35 <150> PRIOR APPLICATION NUMBER: FI 92 A 000052 

36 <151> PRIOR FILING DATE: 1992-03-02 
38 <160> NUMBER OF SEQ ID NOS : 10 
40 <170> SOFTWARE: Patentln Ver. 2.1 

42 <210> SEQ ID NO: 1 

43 <211> LENGTH: 27 
4 4 <212> TYPE: DNA 

45 <213> ORGANISM: Artificial Sequenced' 

47 <220> FEATURE: / 

48 <223> OTHER INFORMATION: Description of Artificial Sequence: Novel ^ 
4 9 Sequence 

51 <400> SEQUENCE: 1 

52 gcaagcttat cgatgtcgac tcgagct 27 

55 <210> SEQ ID NO: 2 

56 <211> LENGTH: 3960 

57 <212> TYPE: DNA 

58 <213> ORGANISM: Helicobacter pylori 

60 <4 00> SEQUENCE: 2 

61 aaaaagaaag gaagaaaatg gaaatacaac aaacacaccg caaaatcaat cgccctctgg 60 

62 tttctctcgc tttagtagga gcattagtca gcatcacacc gcaacaaagt catgccgcct 120 

63 ttttcacaac cgtgatcatt ccagccattg ttgggggtat cgctacaggc accgctgtag 180 

64 gaacggtctc agggcttctt agctgggggc tcaaacaagc cgaagaagcc aataaaaccc 240 

65 cagataaacc cgataaagtt tggcgcattc aagcaggaaa aggctttaat gaattcccta 300 

66 acaaggaata cgacttatac agatcccttt tatccagtaa gattgatgga ggttgggatt 360 

67 gggggaatgc cgctaggcat tattgggtca aaggcgggca acagaataag cttgaagtgg 420 
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68 
69 
70 
71 
72 
73 
74 
75 
76 
77 
78 
79 
80 
81 
82 
83 
84 
85 
86 
87 
88 
89 
90 
91 
92 
93 
94 
95 
96 
97 
98 
99 
100 
101 
102 
103 
104 
105 
106 
107 
108 
109 
110 
111 
112 
113 
114 
115 
116 



atatgaaaga 
atttagatgt 
ttacaagcta 
tctcaattga 
ccagctctac 
aaatttctct 
gtaatgtgtg 
cgataaacac 
aaaacgccgc 
tgtggcaaag 
ccaataatac 
acaaacaaga 
cgcaaaaaac 
acacggttgt 
ggtttaaagc 
atctgtccaa 



cgctgtaggg acttatacct 
caatatgcaa aaagccactt 



taaggatagt 
taattttgta 
ggttttgact 
ttatgatggt 
gatgggccgt 
ttcaaaagta 
tcaagcgggc 



tatcagggct tagaaacttt actggtgggg 
tacgcttggg ccaattcaat ggcaattctt 
ccacgagagt ggatttcaac gctaaaaata 
atcgtgtggg ttctggagcc gggaggaaag 
cagaagggat cactagcgat aaaaacgctg 
gccacgctca atttggcttc aagcagcgtt aaattaatgg 
ttgcaatacg tgggagcgta tttggcccct tcatacagca 
acaggggaag tgaattttaa ccacctcact gttggcgata 



gctgatcgca 
gaaatcaaca 
ttgcaagctt 



attatcgcta 



cgccgggtta aacattatcg 
cccttctcaa agtggtgcta 
gagcagtcaa aataatagta 
agaagttcaa cccacgcaag 
caatatcaac cgcatcaaca 
ttctcttacc accaatgcgg 
tcaagcgagc gggcgctctc ttatagtgga 



aagtgggtgg 
ataccaaaaa 
taaaggtgga 
acaccttaga 
ccactaatgt 



ctcaagatct 
gtaatgtttc 
agcgcctagc 



ccgttgatgg gcctttaaga gtgaataatc 
gcgcgaattt tgagtttaag gctggtacgg 
acgatattag tctgggaaga tttgtgaatt 
aaggtattga tacgggtaat ggtggtttca 
aagtcaatat caacaagctc attacggctt 
ttaatgaatt gattgttaaa accaatggga 
aagatatagg cagtcaatcg cgcatcaata 
ttttctctgg gggtgttaaa tttaaaggtg 
atagcccttg gaattatttt gacgctagaa 
ttgcttttgg acctcaagga agtccttggg 
ccctaggtca aaatgcggt.c atggattata 
atttcatcaa caatcaaggc actatcaact 
taagcgtagg caatgcagca gctatgatgt 
tttacaaacc gctcatcaag attaacagcg 
ttttattgaa agcgaaaatc attggttatg 
gtaatgttaa tctagaagag caattcaaag 
gcatggatac ttgtgtggtg cgaaatactg atgacattaa 
gcgatcaaag catggtgaac aaccctgaca attacaagta 
aaaatatagg gatcagcaaa acagctaatg gctctaaaat 
attctacgcc tactgagaat ggtggcaata ccacaaattt 
atgcacgttc tgccaacaac gcccttgcac 
ctcctaattt agtcgctatc aatcagcatg 
tggctaaccg ctctaaagat attgacacgc 
atctcttaca aaccttattg attgatagcc 
atgctacaag cgctaatgaa atcaccaagc aattgaatac 
acatagccag tttagagcat aaaaccagcg gcttacaaac 
tgattttaaa ttctcgttta gtcaatctct 
tcgccaaacg cttacaagct ttaaaagacc 
aagtgttgta tcaatttgcc cctaaatatg 
ttgggggaac gagcttgaat aatggctcta 

tagacgctta ccttaacggg caagtggaag ccattgtggg cggttttgga 
atagctcttt taataatcgt gcgaactccc ttaactctgg ggccaataac 
gcgtgtatag ccgtattttt gccaaccagc atgaatttga ctttgaagct 



ataaaaagac taatattggc acactggatt 
ctcctccaga aggtggctat aaggataaac 
aaaacgacaa aaatgaaagc gctaaaaacg 
acactcaggt cattaaccca cccaatagtg 
tcattgatgg gccttttgcg ggcggcaaag 
ctaacgctga tggcacgatt agagtgggag 
ctcatttgca tatcggcaaa ggcggtgtca 
aaatctaact gggaatatca 
ctatgctttg gcaggatcaa 
cggcacagcc acttttaata 
tgctcataca gctaatttta 
ttttagtggc gttacagaca 
ggccgttaaa aacttcaaca 
taagtgtggg ggaatatact cattttagcg 
ccgtgcgttt ggaaactggc actaggtcac 
gcgaaaaatt ggttatagat gagttttact 
atattaaaaa tgttgaaatc accaataaac 
gcacatcaaa acttatgttc aataatctaa 
gccaattttc aaatttaacc attcaagggg 
atctggtccg aggtgggaaa gtggcaacct 
ttaataatga tatagacagc gcgaccggat 
cattaaaaat acagaacatg 
tacaggtacc aatggcatta 
cctttataac aacaataacc 
agcatgcggt atggctatcg 
tcttatcggt aaggcatgga 
ttcggtgtat tatttaggca 
acccacaaac accactagca 
aaaacgctcc tttcgctcaa cctagtgcta 
attttggcac tattgaaagc gtgtttgaat 
tttatgctaa ctcaggcgct caaggcaggg 
ttatgccaga aaaatgattg 
ggccactacc actttaaaca 
tttgagcttg agtaatgcga 
caccaaccat attgactcgt 
ttctttagaa 



atgatgcggg 



ccaggagaca 
aaaaattcgc 



aaaaacctac caatgtttgg 
acgcttcatt gtatggcaca 



agcgcggcag 
gctaacgcta 
agcgcgggcg 
agctatggtt 
actaattttg 
caaggggcac 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 ' 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
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117 tagggagcga tcaatcaagc ttgaatttca aaagcgctct attacaagat 

118 gctatcatta cttagcctat agcgctgcaa caagagcgag ctatggttat 

119 tttttaggaa cgctttagtg ttaaaaccaa gcgtgggtgt gagctataac 

120 caaccaactt taaaagcaac agcaccaatc aagtggcttt gaaaaatggc 

121 agcatttatt caacgctagc gctaatgtgg aagcgcgcta ttattatggg 

122 acttctacat gaatgctgga gttttacaag agttcgctca tgttggctct 

123 cgtctttaaa cacctttaaa gtgaatgccg ctcgcaaccc tttaaatacc 

124 tgatgatggg tggggaatta aaattagcta aagaagtgtt tttgaatttg 

125 atttgcacaa t.ttgatttcc aatataggcc atttcgcttc caatttagga 

126 gtttctaaat accgctctta aacccatgct caaagcatgg gtttgaaatc 

129 <210> SEQ ID NO: 3 

130 <211> LENGTH: 1296 

131 <212> TYPE: PRT 

132 <213> ORGANISM: Helicobacter pylori 

134 <400> SEQUENCE: 3 

135 Met Glu lie Gin Gin Thr His Arg Lys lie Asn Arg Pro 

136 1 5 10 

138 Leu Ala Leu Val Gly Ala Leu Val Ser lie Thr Pro Gin 

139 ' 20 25 

141 Ala Ala Phe Phe Thr Thr Val He He Pro Ala He Val 

142 35 40 * 45 

144 Ala Thr Gly Thr Ala Val Gly Thr Val Ser Gly Leu Leu 

145 50 55 60 
14 7 Leu Lys Gin Ala Glu Glu Ala Asn Lys Thr Pro Asp Lys 
148 65 70 75 

150 Val Trp Arg He Gin Ala Gly Lys Gly Phe Asn Glu Phe 

151 85 90 

153 Glu Tyr Asp. Leu Tyr Arg Ser Leu Leu Ser Ser Lys lie 

154 100 105 

156 Trp Asp Trp Gly Asn Ala Ala Arg His Tyr Trp Val Lys 

157 115 120 125 

159 Gin Asn Lys Leu Glu Val Asp Met Lys Asp Ala Val Gly 

160 130 135 140 

162 Leu Ser Gly Leu Arg Asn Phe Thr Gly Gly Asp Leu Asp 

163 145 150 155 

165 Gin Lys Ala Thr Leu Arg Leu Gly Gin Phe Asn Gly Asn 

166 165 170 

168 Ser Tyr Lys Asp Ser Ala Asp Arg Thr Thr Arg Val Asp 

169 , 180 185 

171 Lys Asn He Ser He Asp Asn Phe Val Glu lie. Asn Asn 

172 195 200 205 

174 Ser Gly Ala Gly Arg Lys Ala Ser Ser Thr Val Leu Thr 

175 210 215 220 

177 Ser Glu Gly lie Thr Ser Asp Lys Asn Ala Glu He Ser 

178 225 230 - 235 

180 Gly Ala Thr Leu Asn Leu Ala Ser Ser Ser Val Lys Leu 

181 245 250 

183 Val Trp Met Gly Arg Leu Gin Tyr Val Gly Ala Tyr Leu 

184 260 265 



ttgaatcaaa 
gacttcgcgt 
catttaggtt 
tctagcagtc 
gacacttcat 
aataacgccg 
catgccagag 
ggcgttgttt 
atgaggtata 
ttacaaaaca 



3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 



Leu Val Ser 
15 

Gin Ser His 
30 

Gly Gly lie 

Ser Trp Gly 

Pro Asp Lys 
80 

Pro Asn Lys 
95 

Asp Gly Gly 
110 

Gly Gly Gin 

Thr Tyr Thr 

Val Asn Met 
160 

Ser Phe Thr 

175 
Phe Asn Ala 
190 

Arg Val Gly 

Leu Gin Ala 

Leu Tyr Asp 
240 

Met Gly Asn 

255 
Ala Pro Ser 
270 
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186 


Tvr 

x Jr ■ L 


Ser 


Thr 


He 


Asn 


Thr 


Ser 


Lys 


Val 


Thr 


Glv 


Glu 


Val 


Asn 


Phe 


Asn 


187 






275 










280 










285 








189 


His 


Leu 


Thr 


Val 


Glv 


Asp 


Lys 


Asn 


Ala 


Ala 


Gin 


Ala 


Glv 


He 


He 


Ala 


190 




290 










295 










300 










192 


Asn 


Lys 


Lys 


Thr 


Asn 


He 


Gly 


Thr 


Leu 


Asp 


Leu 


Tro 


Gin 


Ser 


Ala 


Gly 


193 


'305 










310 










315 










320 


195 


Leu 


Asn 


He 


He 


Ala 


Pro 


Pro 


Glu 


Glv 


Glv 


Tvr 


Lys 


Asp 


Lys 


Pro 


Asn 


196 










325 










330 










335 




198 


Asn 


Thr 


Pro 


Ser 


Gin 


Ser 


Gly 


Ala 


Lys 


Asn 


Asp 


Lys 


Asn 


Glu 


Ser 


Ala 


199 








340 










345 










350 






\J J. 


Lys 


Asn 


Asp 


Lys 


Gin 


Glu 


Ser 


Ser 


Gin 


Asn 


Asn 


Ser 


Asn 


Thr 


Gin 


Val 


202 






355 










360 










365 








204 


He 


Asn 


Pro 


Pro 


Asn 


Ser 


Ala 


Gin 


Lys 


Thr 


Glu 


Val 


Gin 


Pro 


Thr 


Gin 


205 




370 










375 










380 










207 


Val 


He 


Asp 


Gly 


Pro 


Phe 


Ala 


Gly 


Gly 


Lys 


Asp 


Thr 


Val 


Val 


Asn 


He 


208 


385 










390 










395 










400 


210 


Asn Arg 


He 


Asn 


Thr 


Asn 


Ala 


Asp 


Gly 


Thr 


He 


Arg 


Val 


Gly 


Gly 


Phe 


211 










405 










410 










415 




213 


Lys 


Ala 


Ser 


Leu 


Thr 


Thr 


Asn 


Ala 


Ala 


His 


Leu 


His 


He 


Gly 


Lys 


Gly 


214 








420 










425 










430 






216 


Gly Val 


Asn 


Leu 


Ser 


Asn 


Gin 


Ala 


Ser 


Gly 


Arg 


Ser 


Leu 


He 


Val 


Glu 


217 






435 










440 










445 








219 


Asn 


Leu 


Thr 


Gly 


Asn 


He 


Thr 


Val 


Asp 


Gly 


Pro 


Leu 


Arg 


Val 


Asn 


Asn 


220 




450 










455 










4 60 










222 


Gin 


Val 


Gly 


Gly' 


Tvr 


Ala 


Leu 


Ala 


Gly 


Ser 


Ser 


Ala 


Asn 


Phe 


Glu 


Phe 


223 


465 










470 










475 










480 


225 


Lys 


Ala 


Gly 


Thr 


Asp 


Thr 


Lys 


Asn 


Gly 


Thr 


Ala 


Thr 


Phe 


Asn 


Asn 


Asp 


226 










485 










490 










495 




228 


He 


Ser 


Leu 


Gly 


Arg 


Phe 


Val 


Asn 


Leu 


Lys 


Val 


Asp 


Ala 


His 


Thr 


Ala 


229 








500 










505 










510 






231 


Asn 


Phe 


Lys 


Gly 


He 


Asp 


Thr 


Glv 


Asn 


Gly 


Glv 


Phe 


Asn 


Thr 


Leu 


Asp 


232 






515 










520 










525 








234 


Phe 


Ser 


Gly Val 


Thr 


Asp 


Lys 


Val 


Asn 


He 


Asn 


Lys 


Leu 


He 


Thr 


Ala 


235 




530 










535 










540 










237 


Ser 


Thr 


Asn 


Val 


Ala 


Val 


Lys 


Asn 


Phe 


Asn 


lie 


Asn 


Glu 


Leu 


He 


Val 


238 


545 










550 










555 










560 


t. 1 u 


Lys 


Thr 


Asn 


Gly 


He 


Ser 


Val 


Gly 


Glu 


Tvr 


Thr 


His 


Phe 


Ser 


Glu 


Asp 


241 










565 










570 










575 




243 


He 


Gly 


Ser 


Gin 


Ser 


Arg 


He 


Asn 


Thr 


Val 


Arg 


Leu 


Glu 


Thr 


Gly 


Thr 


244 








580 










585 










590 






246 


Arg 


Ser 


Leu 


Phe 


Ser 


Gly 


Gly 


Val 


Lys 


Phe 


Lys 


Gly 


Gly 


Glu 


Lys 


Leu 


247 






595 










600 










605 








249 


Val 


He 


Asp 


Glu 


Phe 


Tyr 


Tyr 


Ser 


Pro 


Trp 


Asn 


Tyr 


Phe 


Asp 


Ala 


Arg 


250 




610 










615 










620 










252 


Asn 


He 


Lys 


Asn 


Val 


Glu 


He 


Thr 


Asn 


Lys 


Leu 


Ala 


Phe 


Gly 


Pro 


Gin 


253 


625 










630 










635 










640 


255 


Gly 


Ser 


Pro 


Trp 


Gly 


Thr 


Ser 


Lys 


Leu 


Met 


Phe 


Asn 


Asn 


Leu 


Thr 


Leu 


256 










645 










650 










655 




258 


Gly Gin 


Asn 


Ala 


Val 


Met 


Asp 


Tyr 


Ser 


Gin 


Phe 


Ser 


Asn 


Leu 


Thr 


He 
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259 



660 



665 



670 



261 Gin Gly Asp Phe lie Asn Asn Gin Gly Thr lie Asn Tyr Leu Val Arg 

262 675 680 685 

264 Gly Gly Lys Val Ala Thr Leu Ser Val Gly Asn Ala Ala Ala Met Met 

265 690 695 700 

267 Phe Asn Asn Asp lie Asp Ser Ala Thr Gly Phe Tyr Lys Pro Leu lie 

268 705 710 715 720 

270 Lys lie Asn Ser Ala Gin Asp Leu lie Lys Asn Thr Glu His Val Leu 

271 725 730 735 

273 Leu Lys Ala Lys He He Gly Tyr Gly Asn Val Ser Thr Gly Thr Asn 

274 740 745 750 

276 Gly He Ser Asn Val Asn Leu Glu Glu Gin Phe Lys Glu Arg Leu Ala 

277 755 760 765 

27 9 Leu Tyr Asn Asn Asn Asn Arg Met Asp Thr Cys Val Val Arg Asn Thr 
280 770 775 780 

282 Asp Asp He Lys Ala Cys Gly Met Ala He Gly Asp Gin Ser Met Val 

283 785 790 795 800 

285 Asn Asn Pro Asp Asn Tyr Lys Tyr Leu He Gly Lys Ala Trp Lys Asn 

286 805 810 815 

288 He Gly He Ser Lys Thr Ala Asn Gly Ser Lys He Ser Val Tyr Tyr 

289 820 825 830 

291 Leu Gly Asn Ser Thr Pro Thr Glu Asn Gly Gly Asn Thr Thr Asn Leu 

292 835 840 845 

2 94 Pro Thr Asn Thr Thr Ser Asn Ala Arg Ser Ala Asn Asn Ala Leu Ala 
295 850 855 860 

297 Gin Asn Ala Pro Phe Ala Gin Pro Ser Ala Thr Pro Asn Leu Val Ala 

298 865 870 875 880 

300 lie Asn Gin His Asp Phe Gly Thr He Glu Ser Val Phe Glu Leu Ala 

301 885 890 895 

303 Asn Arg Ser Lys Asp He Asp Thr Leu Tyr Ala Asn Ser Gly Ala Gin 

304 900 905 910 

306 Gly Arg Asp Leu Leu Gin Thr Leu Leu lie Asp Ser His Asp Ala Gly 

307 915 920 925 

309 Tyr Ala Arg Lys Met He Asp Ala Thr Ser Ala Asn Glu He Thr Lys 

310 930 935 940 

312 Gin Leu Asn Thr Ala Thr Thr Thr Leu Asn Asn He Ala Ser Leu Glu 

313 945 950 955 960 

315 His Lys Thr Ser Gly Leu Gin Thr Leu Ser Leu Ser Asn Ala Met He 

316 965 970 975 

318 Leu Asn Ser Arg Leu Val Asn Leu Ser Arg Arg His Thr Asn His lie 

319 980 985 990 

321 Asp Ser Phe Ala Lys Arg Leu Gin Ala Leu Lys Asp Gin Lys Phe Ala 

322 995 1000 1005 

324 Ser Leu Glu Ser Ala Ala Glu Val Leu Tyr Gin Phe Ala Pro Lys Tyr 

325 1010 1015 1020 

327 Glu Lys Pro Thr Asn Val Trp Ala Asn Ala lie Gly Gly Thr Ser Leu 

328 1025 1030 1035 1040 

330 Asn Asn Gly Ser Asn Ala Ser Leu Tyr Gly Thr Ser Ala Gly Val Asp 

331 1045 1050 1055 
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